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Abstract 

Exome sequencing was used as a diagnostic tool in a Roma/Gypsy family with 
three subjects (one deceased) affected by lissencephaly with cerebellar hypopla- 
sia (LCH), a clinically and genetically heterogeneous diagnostic category. Data 
analysis identified high levels of unreported inbreeding, with multiple rare/novel 
"deleterious" variants occurring in the homozygous state in the affected indi- 
viduals. Step-wise filtering was facilitated by the inclusion of parental samples 
in the analysis and the availability of ethnically matched control exome data. 
We identified a novel mutation, p.Asp487Tyr, in the VLDLR gene involved in 
the Reelin developmental pathway and associated with a rare form of LCH, the 
Dysequilibrium Syndrome. p.Asp487Tyr is the third reported missense muta- 
tion in this gene and the first example of a change affecting directly the func- 
tionally crucial /?-propeller domain. An unexpected additional finding was a 
second unique mutation (p.Asn494His) with high scores of predicted pathoge- 
nicity in KCNV2, a gene implicated in a rare eye disorder, retinal cone dystro- 
phy type 3B. This result raised diagnostic and counseling challenges that could 
be resolved through mutation screening of a large panel of healthy population 
controls. The strategy and findings of this study may inform the search for new 
disease mutations in the largest European genetic isolate. 
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A Roma/Gypsy family with three subjects (one deceased) 
(Fig. SI) affected by a defect in brain development was 
referred for diagnostic investigations. The clinical features 
(Table SI) included global developmental delay, moderate 
to severe intellectual deficit, nonprogressive severe truncal 
ataxia, dysarthric speech, gaze-evoked nystagmus, mild 
intentional tremor, and pyramidal signs. Neuroimaging 
(Fig. S2) showed global small brain, pontocerebellar 
hypoplasia, and mild to moderate cortical thickening with 
gyral simplification more pronounced in the frontal and 
temporal regions. The phenotype was classified broadly as 
lissencephaly with cerebellar hypoplasia (LCH), a hetero- 
geneous diagnostic category of cortical malformations 
where some patients have defects in the Reelin neuronal 
migration pathway but a significant proportion of 
cases remain unexplained (reviewed in Ross et al. 2001; 
Barkovich 2012). LCH genetic heterogeneity prompted us 
to choose exome sequencing as an efficient diagnostic 
approach. The analysis included the two living patients 
and one set of parents (Fig. SI). Written informed con- 
sent was obtained from the parents; the study complies 
with the ethical guidelines of the institutions involved. 

Exome capture (lUumina TruSeq) and sequencing 
(lUumina HiSeq 2000, lUumina Inc., San Diego, CA) were 
performed by Axeq Technologies (Seoul, South Korea). 
After initial quality control, data analysis included align- 
ment to the hgl9 reference genome (Li and Durbin 

2009) , variant calling in SAMtools (Li et al. 2009) using 
default parameters, and identification of variants in 
dbSNP135 (http://www.ncbi.rLlm.nih.gov/projects/SNP/). 
Variants were annotated using ANNOVAR (Wang et al. 

2010) version 23 October 2012, and ANNOVAR-formatted 
databases based on the UCSC Known Gene ("hgl9_known- 
Gene"), the 1000 Genomes project (http://www.1000 ge- 
nomes.org/) ("hgl9_ALL.sites.2012_02"), and the NHLBI 
Exome Sequencing Project (http://evs.gs.washington.edu/ 
EVS/; "hgl9_esp6500_all"). From the exome sequencing 
data, a set of 5521 polymorphic markers (intermarker dis- 
tance >0.15cM, in approximate linkage equilibrium, 
average heterozygosity 0.42) were extracted (Smith et al. 

2011) and used to estimate inbreeding coefficients (Leute- 
negger et al. 2003). The search for the disease-causing 
mutation focused on rare variants (<1% in public data- 
bases) including nonsense, exonic indels, affecting splicing 
sites, and missense variants predicted to be deleterious by 
Polyphen2 (Adzhubei et al. 2010) (ANNOVAR database 
"hgl9_ljb_pph2") and SIFT (Ng and Henikoff 2001) 
(ANNOVAR database "hgl9_avsift"). 



In contrast to the reported genealogy, inbreeding analy- 
sis revealed close parental consanguinity (Fig. SI) which, 
together with the pedigree structure suggesting autosomal 
recessive inheritance, led us to assume autozygosity for a 
rare/unique deleterious variant. Out of a total of 63,000- 
68,000 variants present in each affected subject, our step- 
wise filtering strategy (Fig. S3) identified ca. 500 rare "del- 
eterious" changes (0.73% of all variants) that were homo- 
zygous in each patient, including 309 shared by both 
patients. The final filtering criteria required heterozygosity in 
the parents and no homozygosity among control Roma exo- 
mes. This left two novel missense mutations in neighboring 
genes on chromosome 9p24: a G>T (hgl9 chr9:2645720; 
RefSeq NM_003383.3, exonlO: C.1459G>T; NP_003374.3: 
p.(Asp487Tyr)) in VLDLR and an A>C (hgl9 chr9: 2729569; 
RefSeq NM_133497.3, exon2: c.l480A>C; NP_598004.1: 
p.(Asn494His)) in KCNV2 (Fig. la). The predicted amino 
acid substitutions were nonconservative: from the acidic 
polar Aspartic acid to the aromatic nonpolar Tyrosine in 
VLDLR and from the neutral polar Asparagine to the basic 
polar Histidine in KCNV2. Both affected evolutionary 
conserved positions, with deleterious effects predicted with 
very high probability by PolyPhen-2 and SIFT (Fig. lb 
and c). Both genes have been implicated in rare Mendelian 
disorders: VLDLR (very low-density lipoprotein receptor) 
- cerebellar ataxia, mental retardation, and disequilibrium 
syndrome 1, CAMRQl, MIM#224050 and KCNV2 (volt- 
age-gated potassium channel subunit Kv8.2) - retinal cone 
dystrophy, RCD3B, MIM#6 10356. 

The p.Asp487Tyr mutation in VLDLR could explain 
the neurological phenotype, classifying the affected indi- 
viduals as VLDLR-associated Dysequilibrium Syndrome 
(DES), a rare condition with eight disease-causing (two 
missense) mutations reported to-date (Boycott et al. 
2009; Kolb et al. 2010; Ali et al. 2012). The VLDL recep- 
tor is part of the Reelin developmental pathway, orches- 
trating the migration of glutamatergic neurons into 
cortical layers, the alignment of pyramidal neurons in the 
hippocampus, and the dispersal of Purkinje cells in the 
cerebellum (D'Arcangelo et al. 1995, 1999; Trommsdorff 
et al. 1999). Reelin signaling is regulated through inter- 
nalization and rapid uncoupling of the ligand from the 
VLDL receptor due to conformational changes at endoso- 
mal pH, whereupon the ligand is targeted for lysosomal 
degradation and the receptor is recycled to the cell mem- 
brane (Fig. 2) (Duit et al. 2010; Reddy et al. 2011). The 
p.Asp487Tyr mutation can be predicted to disrupt the jS- 
propeller protein domain, shown to be essential for ligand 
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Figure 1. Two unique missense variants identified by exome sequencing in tine affected family, (a) Integrative Genomics Viewer snapsinot of the 
short reads aiignment from the exome sequencing (upper panel) and confirmatory Sanger sequencing (lower panel); left G>T (hg19 chr9:264B720) in 
VLDLR, right A>C (hg19 chr9:2729569) in KCNV2. (B, C) UCSC and IVIultalign (Corpet 1988) analysis of the evolutionary conservation of VLDLR 
Asp487 (b) and KCNV2 Asn494 (c) interspecies (upper panel) and protein family members (lower panel) comparisons. The VLDLR mutation affects a 
strictly conserved amino acid residue in the consensus repeat motif of the second blade in the /?-propeller structure of the protein. The deleteriousness 
prediction scores in PolyPhen-2 equaled 1 .00 for both mutations; SIFT scores were 0.00 for the VLDLR and 0.05 for the KCNV2 change. VLDLR, very 
low-density lipoprotein receptor; UCSC, University of California, Santa Cruz, genome browser; SIFT, sorting intolerant from tolerant algorithm. 
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Figure 2. Mechanisms of action of the very low-density lipoprotein receptor (VLDL) receptor and potential pathogenic effects of the p.Asp487Tyr 
substitution, (a) In the consensus repeat motifs (Tyr-Trp-Thr-Asp), Asp residues serve as clasps between adjacent blades of the /J-propeller, 
stabilizing the structure by hydrogen-bonding with the backbone and Trp side chains (red dotted lines). The mutation is predicted to disrupt these 
interactions, leading to a misfolded /J-propeller. (b) The correctly folded receptor protein is targeted to the neuronal surface. Upon Reelin binding 
the complex is internalized, with conformational changes induced by the endosomal pH leading to dissociation and lysosomal degradation of the 
ligand, while the receptor is recycled to the cell membrane, (c) The misfolded mutant receptor may be retained in the endoplasmic reticulum and 
targeted for degradation, (d) Alternatively, correct membrane targeting and ligand binding are followed by lack of conformational changes at 
acid pH, impaired ligand release, and targeting the entire ligand-receptor complex for degradation. The ribbon diagram in (a) was constructed in 
PyMol (www.pymol.org) from the corresponding crystal structure of the YWTD repeat of the LDL receptor (PDB ID 1IJQ). Schematic in (b-d) 
adapted from Beglova et al. (2005). 



release and receptor recycling in the closely related LDL 
receptor (Rudenko et al. 2002; Beglova and Blacklow 
2005). The pathogenic effect may involve protein misfold- 
ing and impaired trafficking, as proposed for another 
VLDLR mutation, p.Asp521His (Boycott et al. 2009) or, 
alternatively, may interfere with ligand dissociation upon 
internalization (Fig. 2). 

In contrast to the VLDLR mutation, which was an 
obvious candidate accounting for the brain malformation 
and ensuing phenotype, the KCNV2 change was an unex- 
pected finding of unknown cUnical significance and coun- 
seling implications. Retinal cone dystrophy type 3B is a 
slowly progressing disorder of variable severity, whose 
diagnosis relies on specific electroretinographic findings 
(Robson et al. 2010). The sustained cooperation required 
during electroretinography was unachievable in our 
patients in view of their mental retardation, and no rele- 
vant information could be obtained from the care provid- 
ers, leading us to resort to mutation screening in 



ethnically matched controls as the feasible approach to 
plausibility assessment. A panel of healthy Roma controls 
from a range of subisolates (Kalaydjieva et al. 2005) was 
tested using custom-designed TaqMan® SNP Genotyping 
Assays (Applied Biosystems, Mulgrave, VIC, Australia) 
(Table S2). The VLDLR p.Asp487Tyr variant was not 
detected in 566 control subjects, suggesting that it is a 
private mutation confined to this consanguineous family. 
By contrast, the KCNV2 variant was very common across 
subisolates, with 101 carriers (14%) and 8 homozygotes 
(1.1%) identified among 721 controls. This unusually 
high frequency, with an improbable prevalence of -1/100 
of presumably affected individuals (under the assumption 
of complete penetrance) indicated that, contrary to 
bioinformatics predictions, the KCNV2 change was a 
polymorphism, not a pathogenic mutation. 

The population genetic characteristics of the Roma 
population, with strong founder effects, genetic drift, and 
limited diversity, have been described in previous studies 
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(reviewed in Kalaydjieva et al. 2005). What has become 
apparent from recent exome sequencing data is a surpris- 
ingly high level of inbreeding (this study and Guerguelt- 
cheva et al. 2012) that could be due to unrecognized 
consanguinity and the cumulative effects of historical 
endogamy and small population size. As a result, Roma 
exomes present with a large absolute number and 
proportion of all high quality exome variants per individ- 
ual of homozygous "deleterious" variants, significantly in 
excess of the proportion observed in outbred Caucasian 
samples available in-house (one sided f-test, unequal vari- 
ance, P = 4e"^ df= 4.548) (Table S3). The findings 
emphasize the need for custom-designed family- and pop- 
ulation-based approaches to diagnostic exome sequencing 
in inbred founder populations. In our study, filtering out 
the plethora of "candidate mutations" was made possible 
by the inclusion of parental data and comparison to other 
Roma exomes. The additional challenge of a second 
unique "pathogenic" mutation, not found in over 6500 
exomes in public databases, would have remained unre- 
solved if population-specific data were unobtainable or 
ambiguous, highlighting the medical and ethical dilemmas 
in this type of analyses and the need for ethnically 
matched controls, as well as for further improvement of 
bioinformatics predictions. 

Acknowledgments 

We are grateful to the affected family and control individu- 
als participating in this study. D. N. A. is supported by 
NHMRC Training Fellowship 634551. The study of D. N. 

A. , L. F., and L. K. is supported by Medical and Health 
Research Infrastructure Fund, Western Australia and by 
the Western Australian Institute for Medical Research. M. 

B. is supported by Australian Research Council Future Fel- 
lowship FT 100 100764 and NHMRC Program Grant 
490037. The study of M. B. and R. T. is supported by 
Victorian State Government Operational Infrastructure 
Support and the Australian Government NHMRC Inde- 
pendent Research Institute Infrastructure Support Scheme. 

Conflict of Interest 

None declared. 
References 

Adzhubei, 1. A., S. Schmidt, L. Peshkin, V. E. Ramensky, A. 
Gerasimova, P. Berk, et al. 2010. A method and server for 
predicting damaging missense mutations. Nat. Methods 
7:248-249. 

Ali, B. R., J. L. Silhavy, M. J. Gleeson, J. G. Gleeson, and 
L. Al-Gazali. 2012. A missense founder mutation in VLDLR 



is associated with Dysequilibrium Syndrome without 
quadrupedal locomotion. BMC Med. Genet. 13:80. 

Barkovich, A. J. 2012. Developmental disorders of the 
midbrain and hindbrain. Front. Neuroanat. 6:7. 

Beglova, N., and S. C. Blacklow. 2005. The LDL receptor: how 
acid pulls the trigger. Trends Biochem. Sci. 30:309-317. 

Boycott, K. M., C. Bonnemann, J. Herz, S. Neuert, C. 

Beaulieu, J. N. Scott, et al. 2009. Mutations in VLDLR as a 
cause for autosomal recessive cerebellar ataxia with mental 
retardation (dysequilibrium syndrome). J. Child Neurol. 
24:1310-1315. 

Corpet, F. 1988. Multiple sequence alignment with hierarchical 
clustering. Nucleic Acids Res. 16:10881-10890. 

D'Arcangelo, G., G. G. Miao, S. C. Chen, H. D. Soares, 
J. I. Morgan, and T. Curran. 1995. A protein related to 
extracellular matrix proteins deleted in the mouse mutant 
reeler. Nature 374:719-723. 

D'Arcangelo, G., R. Homayouni, L. Keshvara, D. S. Rice, 
M. Sheldon, and T. Curran. 1999. Reelin is a ligand for 
lipoprotein receptors. Neuron 24:471-479. 

Duit, S., H. Mayer, S. M. Blake, W. I. Schneider, and I. 

Nimpf. 2010. Differential functions of ApoER2 and very low 
density lipoprotein receptor in Reelin signaling depend on 
differential sorting of the receptors. ). Biol. Chem. 
285:4896^908. 

Guergueltcheva, V., D. N. Azmanov, D. Angelicheva, 

K. R. Smith, T. Chamova, L. Florez, et al. 2012. Autosomal- 
recessive congenital cerebellar ataxia is caused by mutations 
in metabotropic glutamate receptor 1. Am. J. Hum. Genet. 
91:553-564. 

Kalaydjieva, L., B. Morar, R. Chak, and H. Tang. 2005. A 
newly discovered founder population: the Roma/Gypsies. 
Bioessays 27:1084-1094. 

Kolb, L. E., Z. Arlier, C. Yalcinkaya, A. K. Ozturk, 
J. A. Mohterno, O. Erturk, et al. 2010. Novel VLDLR 
microdeletion identified in two Turkish siblings with 
pachygyria and pontocerebellar atrophy. Neurogenetics 
11:319-325. 

Leutenegger, A. L., B. Prum, E. Genin, C. Verny, 

A. Lemainque, F. Clerget-Darpoux, et al. 2003. Estimation 
of the inbreeding coefficient through use of genomic data. 
Am. I. Hum. Genet. 73:516-523. 

Li, H., and R. Durbin. 2009. Fast and accurate short read 
alignment with Burrows-Wheeler transform. Bioinformatics 
25:1754-1760. 

Li, H., B. Handsaker, A. Wysoker, T. FenneU, J. Ruan, 
N. Homer, et al. 2009. The Sequence Alignment/Map 
format and SAMtools. Bioinformatics 25:2078-2079. 

Ng, P. C, and S. Henikoff. 2001. Predicting deleterious amino 
acid substitutions. Genome Res. 11:863-874. 

Reddy, S. S., T. E. Connor, E. J. Weeber, and W. Rebeck. 2011. 
Similarities and differences in structure, expression, and 
functions of VLDLR and ApoER2. Mol. Neurodegener. 6:30. 



2013 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc. 



75 



challenges of Diagnostic Exome Sequencing 



D. N. Azmanov ef al. 



Robson, A. G., A. R. Webster, M. Michaelides, S. M. Downes, 
J. A. Cowing, D. M. Hunt, et al. 2010. "Cone dystrophy 
with supernormal rod electroretinogram": a comprehensive 
genotype/phenotype study including fundus 
autofluorescence and extensive electrophysiology. Retina 
30:51-62. 

Ross, M. E., K. Swanson, and W. B. Dobyns. 2001. 
Lissencephaly with cerebellar hypoplasia (LCH): a 
heterogeneous group of cortical malformations. 
Neuropediatrics 32:256-263. 

Rudenko, G., L. Henry, K. Henderson, K. Ichtchenko, 
M. S. Brown, J. L. Goldstein, et al. 2002. Structure of the 
LDL receptor extracellular domain at endosomal pH. 
Science 298:2353-2358. 

Smith, K. R., C. J. Bromhead, M. S. Hildebrand, A. E. Shearer, 
P. J. Lockhart, H. Najmabadi, et al. 2011. Reducing the 
exome search space for mendelian diseases using genetic 
linkage analysis of exome genotypes. Genome Biol. 12:R85. 

Trommsdorff, M., M. Gotthardt, T. Hiesberger, ]. Shelton, 
W. Stockinger, J. Nimpf, et al. 1999. Reeler/Disabled-like 
disruption of neuronal migration in knockout mice lacking 
the VLDL receptor and ApoE receptor 2. Cell 97:689-701. 

Wang, K., M. Li, and H. Hakonarson. 2010. ANNOVAR: 
functional annotation of genetic variants from high- 
throughput sequencing data. Nucleic Acids Res. 38:el64. 

Supporting Information 

Additional Supporting Information may be found in the 
online version of this article: 

Figure SI. Pedigree affected by lissencephaly with cerebel- 
lar hypoplasia. The family belonged to a young, strictly 
endogamous Roma/Gypsy subisolate structured into mul- 
tiple small clans (Kalaydjieva et al. 2005). The family 
reported a single consanguineous marriage at 3rd cousin 
level, between II-3 and II-4 (not shovm), in contrast to 
the high inbreeding coefficients estimated using FEstim 
and SNPs extracted from the exome sequencing data: 
0.038 (I-l), 0.062 (1-2), 0.05 (II-2), and 0.057 (III-l). Ex- 
ome sequencing identified two unique variants, in VLDLR 
and KCNV2, that satisfied all filtering criteria. 



Figure S2. Brain MRI (magnetic resonance imaging) of 
subject II-2. (A) Sagittal TIW and (B) Coronal Tl Inver- 
sion recovery demonstrating marked cerebellar hypopla- 
sia, particularly affecting the inferior hemispheres and 
vermis as a whole. The pons is also notably small. (C) 
Axial T2W showing mild to moderate cortical thickening 
(pachygyria) with simplification of the gyral architectural 
folding. 

Figure S3. Step-wise filtering of the variants identified by 
exome sequencing in the affected individuals. The search 
for the disease mutation was based on the assumption of 
a rare/unique variant homozygous in both patients, het- 
erozygous in the parents, and not homozygous in 25 
Roma exomes available in-house. * Primary quality con- 
trol retained for further analysis variants with quality 
scores > 20 and coverage >4x, excluding changes 
located in segmental duplications and simple repeats; 
^Variants defined as deleterious included nonsynonymous 
amino acid substitutions with a Polyphen2 score >0.8 and 
SIFT score < 0.05, splice-site (±15 nucleotides), nonsense 
and nonstop changes, as well as small in-frame or frame- 
shift insertion/deletions. 

Table SI. Clinical and neuroimaging findings in the 
affected subjects. 

Table S2. Primers, PCR conditions, and TaqMan® 
probes used in the analysis of VLDLR c.l459G>T and 
KCNV2 c.l480A>C. (A) Primers and PCR conditions for 
fragment amplification and Sanger sequencing. (B) Taq- 
Man ® assay primers and probes used in the population 
screening. 

Table S3. (A) Proportion of homozygous "deleterious" 
variants relative to all high-quality variants observed in 
individuals from the Roma family studied and in 28 out- 
bred exomes after filtering. (B) Boxplot of homozygous 
"deleterious" variants relative to all high-quality variants 
observed in individuals from the Roma family studied 
and in 28 outbred exomes after filtering. The circle repre- 
sents an outlier, as determined by the "boxplot" function 
in the statistical software R (http://www.r-project.org/). 
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